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Express Mail Label No. EL952273335US 

[000 1 ] A FAST JOINT DETECTION USER EQUIPMENT 

[0002] This application is a continuation of U.S. Patent Application No. 10/037,7 10, filed 

December 31, 2001, which claims priority to U.S. Provisional Patent Application No. 

60/287,431, filed on April 30, 2001, which applications are incorporated herein by 

reference. 

[0003] BACKGROUND 

[0004] The invention generally relates to wireless communication systems. In particular, 
the invention relates to data detection in a wireless communication system. 
[0005] Figure 1 is an illustration of a wireless communication system 10. The 
communication system 10 has base stations 12, to 12 5 (12) which communicate with user 
equipments (UEs) 14, to 14 3 (14). Each base station 12 has an associated operational area, 
where it communicates with UEs 14 in its operational area. 

[0006] In some communication systems, such as frequency division duplex using code 
division multiple access (FDD/CDMA) and time division duplex using code division 
multiple access (TDD/CDMA), multiple communications are sent over the same frequency 
spectrum. These communications are differentiated by their channelization codes. To more 
efficiently use the frequency spectrum, TDD/CDMA communication systems use repeating 
frames divided into timeslots for communication. A communication sent in such a system 
will have one or multiple associated codes and timeslots assigned to it. 
[0007] Since multiple communications may be sent in the same frequency spectrum and 
at the same time, a receiver in such a system must distinguish between the multiple 
communications. One approach to detecting such signals is multiuser detection (MUD). In 
MUD, signals associated with all the UEs 14, users, are detected simultaneously. Another 
approach to detecting a multi-code transmission from a single transmitter is single user 
detection (SUD). In SUD, to recover data from the multi-code transmission at the receiver, 
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the received signal is passed through an equalization stage and despread using one or 
multiple codes. Approaches for implementing MUD and the equalization stage of SUD 
include using a Cholesky or an approximate Cholesky decomposition. These approaches 
have a high complexity. The high complexity leads to increased power consumption, which 
at the UE 14 results in reduced battery life. Accordingly, it is desirable to have alternate 
approaches to detecting received data. 

[0008] SUMMARY 

[0009] A code division multiple access user equipment receives K data signals over a 

O shared spectrum. The user equipment receives and samples a combined signal having the 

O 

%1 K transmitted data signals. A combined channel response matrix is produced. A block 

2 column of a cross correlation matrix is determined using the combined channel response 

*S matrix. Each block entry of the block column is a K by K matrix. Each block entry of the 

U block column is a K by K matrix. A fourier transform of a complex conjugate transpose of 

W the combined channel response matrix multiplied to the combined signal samples is taken. 

HI 

O An inverse of a fourier transform of each block entry is multiplied to a result of the fourier 
5 transform to produce a fourier transform of the data vector. An inverse fourier transform 
of the data vector fourier transform is taken to produce data of the K signals. 



[001 0] BRIEF DESCRIPTION OF THE DRAWING(S) 

[001 1] Figure 1 is a wireless communication system. 

[00 1 2] Figure 2 is a simplified transmitter and a fast j oint detection receiver. 

[0013] Figure 3 is an illustration of a communication burst. 

[0014] Figure 4 is a flow chart of a preferred embodiment for fast joint detection. 

[0015] Figure 5 is an illustration of a data burst indicating extended processing areas. 

[0016] Figures 6-11 are graphs illustrating the simulated performance of fast joint 
detection to other data detection approaches. 
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[0017] DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT(S) 
[0018] Figure 2 illustrates a simplified transmitter 26 and receiver 28 using fast joint 
detection in a TDD/CDMA communication system, although fast joint detection is 
applicable to other systems, such as FDD/CDMA. In a typical system, a transmitter 26 is in 
each UE 14 and multiple transmitting circuits 26 sending multiple communications are in 
each base station 12. The joint detection receiver 28 may be at a base station 12, UEs 14 or 
both. 

[0019] The transmitter 26 sends data over a wireless radio channel 30. A data generator 
32 in the transmitter 26 generates data to be communicated to the receiver 28. A 
modulation/spreading/training sequence insertion device 34 spreads the data with the 
appropriate code(s) and makes the spread reference data time-multiplexed with a midamble 
training sequence in the appropriate assigned time slot, producing a communication burst 
or bursts. 

[0020] A typical communication burst 1 6 has a midamble 20, a guard period 1 8 and two 
data fields 22, 24, as shown in Figure 3. The midamble 20 separates the two data fields 22, 
24 and the guard period 18 separates the communication bursts to allow for the difference 
in arrival times of bursts transmitted from different transmitters 26. The two data fields 22, 
24 contain the communication burst's data. 

[002 1 ] The communication burst(s) are modulated by a modulator 36 to radio frequency 
(RF). An antenna 38 radiates the RF signal through the wireless radio channel 30 to an 
antenna 40 of the receiver 28. The type of modulation used for the transmitted 
communication can be any of those known to those skilled in the art, such as quadrature 
phase shift keying (QPSK) or M-ary quadrature amplitude modulation (QAM). 
[0022] The antenna 40 of the receiver 28 receives various radio frequency signals. The 
received signals are demodulated by a demodulator 42 to produce a baseband signal. The 
baseband signal is sampled by a sampling device 43, such as one or multiple analog to 
digital converters, at the chip rate or a multiple of the chip rate of the transmitted bursts. The 
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samples are processed, such as by a channel estimation device 44 and a fast joint detection 
device 46, in the time slot and with the appropriate codes assigned to the received bursts. 
The channel estimation device 44 uses the midamble training sequence component in the 
baseband samples to provide channel information, such as channel impulse responses. The 
channel impulse responses for all the transmitted signals can be viewed as a matrix, H. The 
channel information is used by the fast joint detection device 46 to estimate the transmitted 
data of the received communication bursts as soft symbols. 

[0023] The fast joint detection device 46 uses the channel information provided by the 
channel estimation device 44 and the known spreading codes used by the transmitter 26 to 
estimate the data of the desired received communication burst(s). 
[0024] Although fast joint detection is explained using the third generation partnership 
project (3GPP) universal terrestrial radio access (UTRA) TDD system as the underlying 
communication system, it is applicable to other systems. That system is a direct sequence 
wideband CDMA (W-CDMA) system, where the uplink and downlink transmissions are 
confined to mutually exclusive timeslots. 

[0025] The receiver 28 receives a total of K bursts that arrive simultaneously. The K 
bursts are superimposed on top of each other in one observation interval. For the 3GPP 
UTRA TDD system, each data field of a time slot corresponds to one observation interval. 
A code used for a k th burst is represented as C (k) . The K bursts may originate from K 
different transmitters or for multi-code transmissions, less than K different transmitters. 
[0026] Each data field of a communication burst has a predetermined number of 
transmitted symbols, N s . Each symbol is transmitted using a predetermined number of chips, 
which is the spreading factor, SF. Accordingly, each data field has N s x SF chips. After 
passing through the wireless radio channel, each symbol has an impulse response, such as 
of length W chips. A typical value for W is 57. As a result, each received field has a length 
of SF x N s + W - 1 chips or N c chips. 
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[0027] Each field of the K data fields in one observation interval can be modeled at 
the receiver by Equation 1 . 

r _( k ) = A ^d {k) ,k =l---K Equation 1 

[0028] is the received contribution of the k* field. A (k) is the combined channel 

response for the k* field. A (k) is a N c x N s matrix. Each j* column in A (k) is a zero-padded 
version of the symbol response S (k) of the j th element of d (k) . The symbol response S (k) is the 
convolution of the k* field's estimate response, , and spreading code C (k) for the field. 

d (k) is the unknown data symbols in the k th data field. h^ k ) is of length W chips and can be 
represented by Equation 2. 

h {k) = y(k) .£(*) Equation 2 

y^' reflects the transmitter gain and path loss. h_ is the channel impulse response. 
[0029] For uplink communications, each as well as each Y (k) are distinct. For the 

downlink, all of the fields have the same h but each / is different. If transmit diversity 

(k) T (k) 

is used in the downlink, each J and n are distinct. 

[0030] The overall received vector r from all K fields sent over the wireless channel is 
per Equation 3. 
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K 

r + n 



k=\ 



Equation 3 



n is a zero-mean noise vector. 

[003 1 ] By combining the A (k) for all data fields into a total channel response matrix A and 
all the unknown data for each burst d (k) into a total data vector d, Equation 1 becomes 
Equation 4. 

r_ = Ad_+n Equation 4 

[0032] Determining d_ using a MMSE solution is per Equation 5. 

d = R _1 [A h r ) Equation 5 

(■f represents the hermetian function (complex conjugate transpose). R for a MMSE 
solution is per Equation 6. 

R=A H A+o 2 l Equation 6 

a 1 is the noise variance, typically obtained from the channel estimation device 44, and I is 
the identity matrix. 

[0033] Using fast fourier transforms (FFTs), although other fourier transforms may be 
used, this equation is solved, preferably, per Equation 7. 
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[p(d)\ =U k) V[F(A H L)\ Equation7 



F (•) indicates the FFT function. The [] k indicate that the equation is solved at each 

frequency point k. A k ^ are block entries of size K by K of a block diagonal matrix A . The 
derivation of A is described subsequently. Instead of directly solving Equation 7, Equation 
7 can be solved using forward and backward substitution. 

[0034] Figure 4 is a flow chart for a preferred method of determining the data vector d_ 
using fast joint detection. The combined channel response matrix A is determined using the 
estimated responses and the spreading code for each burst c (t) ,48. Form the 

combined channel correlation matrix, R = A H A , 49. At each frequency point, a K by K 
matrix A (K) is determined by taking the fourier transform of block entries of a block column 
of R (block FFT), 50. Preferably, a central column is used, at least w columns from the left 
or right of the R matrix. 

[0035] F [ A H r ] is determined using a FFT of a matrix multiplication, 5 1 . The inverse 



of each A^ , 



A 



(0 



, is determined. Todetermine [F(d_)] k , A k) 1 and[f(A H r)]^ are 



multiplied at each frequency point. Alternately, [F(d_)] k is determined using LU 

decomposition. A^ is decomposed into a lower triangular matrix, L, and an upper 
triangular matrix, U, 52. Using forward, L y_ = [f (a h r_ )] , 53, and backward substitution, 
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U [F (d_ )] = y , 54, [f (d_)] R is determined, d is determined by an inverse FFT of F (d_) , 
55. 

[0036] The derivation of Equation 7 is as follows. A minimum mean square error 
solution of Equation 4 is determined per Equation 8. Although Equation 7 is a MMSE 
based solution, fast joint detection can be implemented using other approaches, such as a 
zero forcing approach. 

Rd =(A H A+<7 2 /)j =A H r_ Equation 8 

If a zero forcing solution is used, the a 2 1 term drops out of Equation 8, such as 
Rd_ = (A H A^d_ = A H r . The following is a derivation for the MMSE solution, although an 

analogous derivation can be used for a zero forcing solution. For illustrative purposes, a 
simplified example of R with N s = 10 and W = 2 is per Equation 9. This example is readily 
extendable to any N s and W. 
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Equation 9 

[0037] The matrix R is of size (KN S ) by (KN S ) in general. Each entry, R] , in the R matrix 

is a K by K block. The sub-matrix, within the dotted lines of R, is block-circulant, i.e. a 
block- wise extension of a circulant matrix. The portion of R, which is not block-circulant, 
depends on the maximum multipath delay spread, W. 

[0038] A block-circulant extension of the matrix R, R c , in Equation 9 is per Equation 10. 
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Equation 10 

[0039] A "Discrete Fourier Transform (DFT)-like" matrix D is determined such that 
R c = DMf . One such matrix D is per Equation 1 1 . 
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I K is a K by K identity matrix. 

[0040] The product D H D is per Equation 9. 



Equation 1 1 



Equation 12 



I m is a KN S by KN S identity matrix. The block circulant matrix R c is multiplied by the D 



matrix, such as per Equation 13. 
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R C D= 



(R 0 +R i H +R 2 H + 



(R 0 +R l H +R 2 H + 



(R 0 +R i H +R 2 H + 



(foe* +R 1 H e N * +R 2 "e"° + . 
R 2 e^ +R,e N ' ) 
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(R i e^+R 0 e^ +R L H e lf ' + . 
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ylOSr 
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Equation 13 

Each entry of R c D is a K by K block. A block-diagonal matrix A is per Equation 14. 



■ A d) 



A* 



A (2) 



A (3) 



A 



Equation 14 
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A is of size (KN S ) by (KN S ). Each entry of A is per Equation 15 



(0 



(0 



(i) 



Equation 15 



A^ is a K by K block and has K 2 non-zero entries. 

[0041] The D matrix is multiplied to the A matrix, such as per Equation 16. 



DA = 



j2n 

A (D A (2) e ~ 
A (D A (2) e — 



j\4n 

A (d A (2) e ^T 

j\67l 

A (D A (2) e — 



j4n 

A (3) e N * 



j2&n 

A (3) e~ 



j\6n 
A (N-D e N> 



j\\2n 



A (NJ e N s 
A (N s ) e N s 



A (N s ) e N s 



Equation 16 



Each entry of DA as shown in Equation 16 is a K by K block. 
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[0042] The system of equations produced by equating each row of R C D with each row of 
DA is consistent. Accordingly, the same set of equations result by equating any row of R C D 

with the same row of DA. To illustrate for Equation 13, the first row-block of R C D is per 
Equation 17. 



jlx j*K j^k jl&r j2dt 

[i^+R^ +R 1 +R 2 ),(R 0 e 1 ^ +R l "e N ^ +R 2 H e N > +R 2 e N > +Rfi N > ),■•-, 

j\(sz j3?x j4& ;144r jl6fr jlto ;3fo j54rt j!6gr j\m 



Equation 17 



[0043] The first row-block of DA is per Equation 18. 



j2n jAn 



[A (1) ,A (2) e N ° ,A (3) ^ ,A W "\ N > ,A w e N > ] 



Equation 18 



[0044] Equating the entries of these two rows, Equations 19 and 20 result. 



Equation 19 



jlK C /2£ j4s lbn_ j20n ") 



jln 

= e N - 



}2ft 



R 0 +R?e N < +R 2 H e N - +R 2 e +R x e N ' 
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Equation 20 
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[0045] As a result, is per Equation 21. 



jln j4n -j2n 



A (2 > = (2^+<^ + * 2 V +ty ) Equation21 

[0046] Similarly, A N ~ X) is per Equation 22. 



j2{N s -2)n j4(N-2)7i -j4(N s -2)n -j2(N-2)n 

Equation 22 



[0047] A (A ' s) is per Equation 23. 

jl(N-\)n j4(Af,-l)7T -jMN.-Dll -;2(jV,-l)^ 

Equation 23 

Although Equations 17-23 illustrate using the first row of R C D and DA, any row can be 
used to determine the A l 's . 

[0048] To illustrate using a center row, (NJ2) A row (row 5 of Equation 7), A (1 ) is per 
Equation 19. 

A (1) ={R 0 +Rf +R 2 H +R i +R 2 ) Equation 19 
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Equations 19-23 compute the FFT of K by K blocks. Since these blocks are multiplied by 
scalar exponentials, this process is referred to as a "block FFT." Typical approaches for 
calculating FFTs, such as the Matlab function "fft" compute the FFTs of a one-sided 



implemented by a fft {0,0," -,^,^,^ ) ,^,^,- • -,0,0} and multiplying it by an appropriate 
exponential function as per Equation 27 for a center row. 



[0049] As shown in Equations 17-27, calculating all the A 1 > can be performed using a 
single column of R. Accordingly, R c is not required to be determined. Any column of R 
can be used to derive the A (i) s directly. Preferably, a row at least W rows from either edge 
of R is used, since these rows have a full set of R,s. 

[0050] Using the A^ s and the D matrix, the block-circulant matrix R c can be re-written 
as Equations 28 and 29. 

R D = DA Equation 28 



sequence. Since each A^ is a two sided sequence, the computation of A 1 ' can be 




where v = 



[ceil(-f)-l] 



Equation 27 




Equation 29 



D and A are each of size (KN S ) by (KN S ). 
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[0051] Since lfD= N S I KN ,D~ l = (VN s )lf, Equation 30 results. 



R c l =N s [(D H rA-\Dy l ] = N s [^-A- 1 ^- 

s s 



] 



Equation 30 



[0052] The MMSE solution is per Equation 3 1 . 



d_ = R; 1 (a h r_ ) 



Equation 31 



The detected data vector d_ is of size (N S K) by 1. 
[0053] The MMSE solution is per Equation 32. 



D H i = A- ] [D H (A H r)] 



Equation 32 



The matrix A is of size (KN S ) by (KN S ) with K by K blocks, and its inverse is per Equation 



33. 
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A" 1 = 



(i) 



A (2) 



A (3) 



A 



-1 



(Dl-l 



[A w ] 



(2)1-1 



[A w ] 



(3)1-1 



[A w ] 



(W.)l-l 



[A w ] 



Equation 33 



The inversion requires an inversion of the K by K matrices 1 v . 
[0054] As a result, the data vector d_ is determined per Equation 34. 



[F (i)! = [A ( * ] [' [F ( A H r )\ Equation 34 



Equation 34 is applicable to both receivers which sample the received signal at the chip rate 
and at a multiple of the chip rate, such as twice the chip rate. For multiple chip rate 
receivers, the R matrix corresponding to the multiple chip rate is of the same form as 
Equation 9, being approximately block circulant. 

[0055] To reduce the complexity in determining F (a h r ) , an FFT approach taking 

advantage of the structure of A may be used. A has an approximately block-circulant 
structure. However, it is a non-square matrix, being of size (N S SF) by (N S K). An illustration 
of a matrix A is per Equation 35. 
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Equation 35 



Each ^(O is the convolution of the channel response h (k) and the spreading code c (k) , for 
the k th user at the j* chip interval of the i* symbol interval. 

[005 6] Using blocks B (•) , where each block is indicated by bracketing in Equation 35 , 
Equation 35 becomes Equation 36. 
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Equation 36 



[0057] As shown, a portion of A is block-circulant. A circulant extension of A is denoted 
as A c . 
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A can be broken into three matrices per Equation 37. 
A=D,A 1 D? 



Equation 37 



Dj is a (N S SF) by (N S SF) matrix. D 2 is a (N S K) by (N S K) matrix and A is a block-diagonal 
matrix of size (N S SF) by (N S K). 

[0058] The block diagonal matrix A 1 has the same form as A of Equation 14. However, 



each of its entries A/ 0 is a SF by K block per Equation 38. 



A,<" = 



2 (0 Q 1 



(0 



(0 



Equation 38 



[0059] D 2 is the same form as that of D in Equation 11. ^ is of the form of Equation 



39. 
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/ SjP is a SF by SF identity matrix. 



Equation 39 



flat 

[0060] In multiplying A c and D 2 , products of the form, B(i) and e N > I K , are formed per 
Equation 40. 



;2tt ;18ff j20r j18t jl62r j!80r 

[5(0)+5(l)+5(2)] [B(0)^+B(2)^ +B®e N >] - [5(0)e * +fi(2> * +5(1)^ ] 
[B(0)+5(l)+5(2)] [B®e N > +B(Q)e N * +B(2)e N > )] -• +W)e N> +B(2)e N > ] 



4A 



jifa jm jWn jut* ;i62r j\m 



[B(0)+£(l)+5(2)] [5(2)^ +flfl> ff ' +BQJ)e N ' ] - [£(2)e +B(l)e K +B(0)e K ] 



Equation 40 
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A C D 2 is of size (N S SF) by (N S K) and each block is a size SF by K. 

[0061] In multiplying D 1 and A t , products of the form, e N ° I SF and A{ 1) , are formed. 

Z}Aj is of size (N S SF) by (N S K) and each block is of size SF by K. Comparing any row of 
4 A with the same row of £W Equation 41 results. 



A, w = [5(0) + 5(1) + 5(2)], 

A/ 2) = [5(0) + B{\)e N > + B(2)e N > ], 

- j2(N,-2)x - j\(N s -2)n 

A/"' -0 = [5(0)+ B(l)e N ' +B(2)e N > ], 

- }2(N s ~\)n - j4(N s -\)tz 

A, (WJ = [5(0) + B(l)e N ' + B(2)e ]. 



Equation 41 

[0062] As a result, each A/ can be determined using the FFT of a one-sided sequence 

of (SF by K) blocks. Using Equation 38 and D 2 H D 2 = N s W Equations 42, 43 and 44 
result. 

A^DjAjZ)/, Equation 42 

A h L = dX(dI I l) Equation 43 
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D 2 H ( A H r_ ) = N s ■ [aJ (D , h r )] Equation 44 



[0063] Accordingly, |f (a w r j] is determined using FFTs per Equation 45. 



Equation 45 



Similarly, since the matrix A is approximately block-circulant, R = A H A + <r 2 / can also be 
computed using FFTs using A, . 

-l 



[0064] To reduce complexity, the inversion of each A^ , 



A 



(0 



, can be performed 



using LU decomposition. Each [a] is a (K by K) matrix whose LU decomposition is per 
Equation 46. 

A (i) = LU Equation 46 

[0065] L is a lower triangular matrix and U is an upper triangular matrix. Equation 7 is 
solved using forward and backward substitution per Equations 47 and 48. 

X^W^Kt)! Equation 47 



V =| A (;)] H [F(^ Equation 48 
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[0066] Preferably, to improve the bit error rate (BER) for data symbols at the ends of 
each data fields 22, 24, samples from the midamble portion 20 and guard period 1 8 are used 
in the data detection as shown in Figure 5. To collect all the samples of the last symbols in 
the data fields, the samples used to determine r are extended by W-l chips (the length of 

the impulse response) into the midamble 20 or guard period 18. This extension allows for 
substantially all the multipath components of the last data symbols of the field to be used for 
data detection. As shown for data field 1 22, the samples are extended into the midamble 
by W-l chips. The midamble sequences are cancelled from the samples taken from the 
midamble 20 prior to data detection processing. For data field 2 24, the samples are 
extended into the guard period 18 by W-l chips. 

[0067] Certain FFT implementations required a certain field length for analysis. One of 
these FFT implementations is a prime factor algorithm (PFA). The PFA implementation 
requires the field length to be a prime number, such as 61. To facilitate PFA FFT 
implementation, the samples used to determine r are preferably extended to a desired PFA 

length. As shown in Figure 5, data field 1 or data field 2 are extended by P chips to the 
desired PFA length. Alternately, block FFTs of 61 symbols are extended to block FFTs of 
length 64, which requires T FFT calculations. Since the approximation of R to a block 
circulant matrix is reduced, the performance typically improves. 

[0068] An analysis of the computational complexity of fast joint detection is as follows. 
The computational complexity of calculating A is K-SF-W . The computational 
complexity of calculating A H A is per Equation 49. 



(k 2 +k) (k 2 -k) 

l——l[2(SF +W -1)-(» B „ -Op^-^-^H^ -0 

where n raax =mm(N s ,(SF +W -\)/SF)+1) Equation 49 
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[0069] Calculating (A H r) A as a matrix-vector multiplication has a computational 
complexity of K N s (SF + W-l). Calculating the FFT of a j th column-block of R requires 

K 2 -(N S log N s ) calculations. Calculating the Fourier transform of A " r requires K( N s 

log2 N s ) calculations. The inversion of each matrix without Cholesky 

decomposition, requires K 3 calculations. For N s frequency points, the total number of 

calculations is N s K 3 . Calculating [f (d_)] k = |a ( * ) ] [f ( A H r requires (K 2 ) multiplications 

for N s frequency points. Accordingly, the total number of calculations is N s K 2 . The Inverse 
FFT of [F(d)] requires K( N s log2 N s ) calculations. 

[0070] To illustrate the complexity for fast joint detection, the million real operations per 
second (MROPs) for processing a TDD Burst Type I with N c = 976, SF = 16, K = 8, N s = 
61 and W = 57 chips is determined. The calculations A, (A H A), a column-block of R, 

[ft k) ] A are performed once per burst, i.e. 100 times per second. The calculations A 11 !, F 
[A"! ], computing [f (d)] k and the inverse FFT of [F(d)] are performed twice per burst, i.e. 

200 times per second. Four calculations are required to convert a complex operation into 
a real operation. The results are illustrated in Table 1 . 



Functions executed once per burst MROPS 

Calculating A 3.0 

Calculating A H A 4.4 

Calculating F([R]j ] 9.2614 

Calculating [A®! -1 12.4928 
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Functions executed twice per burst 
Calculating A H r 
Calculating F[A H r] 



28.11 



2.3154 



Calculating [F(#)\ =[A (fc) F 1 [F(A fl r)] fc 



3.1232 



Inverse FFT of [F(d A )] 2.3154 
Total number of MROPS required for fast joint detection 65 .01 82 



Table 1 



Note: in Table 1, (A 1 ^) was calculated directly as a matrix- vector multiplication. 



54.8678 MROPS. If FFTs are used to determine (aV), the complexity reduces from 

65.0182 MROPS to 63.9928 MROPS. 

[0072] A comparison of the complexity of fast joint detection and other detection 
techniques is as follows. The complexity of the following three techniques for a TDD Burst 
Type I with SF = 16 and K = 8 is per Table 2. 

Technique MROPS 
Approximate Cholesky based Joint Detection, (JDChol) 82.7 

Single User Detection: Approximate Cholesky based 
Equalization followed by a Hadamard Transform based 
Despreading (SDChol) 205.2276 



[0071] If LU decomposition is used to determine A k > , the complexity reduces to 
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Fast Joint Detection (JDFFT) 65.0182 

Table 2 

[0073] The performance of the three detection techniques and a reference matched 
filtering (MF) data detection technique were compared by simulation over 800 timeslots. 
The simulations used the precision provided by Matlab, i.e. no finite precision effects were 
considered. The simulations used channels specified by WCDMA TDD WG4; SF =1 6 and 
K = 8 and 12 and were performed for the downlink with no transmit diversity to facilitate 
comparison to SUD. 

[0074] As shown in Figures 6 and 7, respectively, for cases 1 and 3, the performance of 
fast joint detection, JDFFT, is very close to that of the Cholesky based Joint Detection, 
JDChol. The other data detection approaches did not perform as well as JDChol or JDFFT. 
For the tddWg4Case2 channel as shown in Figure 8. JDFFT shows some degradation 
compared to JDChol. It also performs similarly to the SUD based Cholesky algorithm, 
SDChol. For a high data rate service, such as a 2 Mbps service, as shown in Figures 9-11. 
JDFFT performs close to or slightly worse than JDChol and better than the other approaches . 

* # * 
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